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(64) CI re nt~ server based speech recognition 

(57) A user dictionary, wtifch is formed by storing 
pronunciations and notions of target recognition 
words desisted by the user in eoir«po«teneo with 

each other, Input speech recognition deia, and diction- 
ary management data used to determine the recognition 
field of a recognition dictionary used in recogn&lon of 
the speech recognition data are sent to a server via a 
communication modulo. In the server, a dictionary man* 
agement unit loota up an iden&fter table to determine a 
recognition dictionary corresponding to the dictionary 
management information received from a dient from a 
plurality of kinds of recognition dictionaries, A speech 
recognition modulo recognizes tho speech recognition 
data using at feast {ho o&ertnfeted necognSion diction- 
ary. The recognition result is soot to ma cfiam via a com* 
munfcatlon modukr. 
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Description 

FIELD OF THE INVENTION 

[0001] Tho present invention relates to a client-server 5 
speech recognition system tor recognizing speech Input 
at a client by a server, a speech recognition server, a 
speech recognition client, their control method, and a 
computer readable memory. 

fO 

BACKGROUND OF THE INVENTION 

[0002] In recent years, speech is used as an input In- 
terface in addition to a keyboard, mouse, and the like. 
[0009] However, the recognition rate of speech rec- « 
ognrtion that recognizes input speech lowers and re- 
quires a longer processing time as the number of rec- 
ognition words which are to undergo speech recognition 
becomes larger. For this reason, in an actual method, a 
plurality of recognition dictionaries or lexicons that rag- & 
Ister recognition words (e.g., pronunciations and nota- 
tions) which are to undergo speech recognition are pre- 
pared, and are selectively used (a plurality of recognition 
dictionaries may be used at the same time). 
[0004) Also, unregistered words cannot be recog- 22 
nized. As one of methods for solving this problem, a user 
dictionary or lexicon (prepared by the user to register 
recognition words which are to undergo speech recog- 
nition) may be used. 

[0005] On the other hand, a client-server speech rec- » 
ognltlon system has been studied to implement speech 
recognition on a tormina! with insufficient resources. 
[0006] Thesa three techniques are known to those 
who are skilled in the art, but a system thai combines 
these three techniques has not been realized yet, as 

SUMMARY OF THE INVENTION 

[0007] The present Invention has been made to solve 
the above problems, and has as its object to provide a 40 
speech recognition system which uses a user dictionary 
in response to a user's request in a client-server speech 
recognition system so as to improve speech input effi- 
ciency and to reduce the processing load on the entire 
system, a speech recogn itton server, a speech rocogn !• 
8on client, their control method, and a computer reada- 
ble memory. 

(0008) According to the present Invention, the forego- 
ing object is attained by providing, a client-server 
speech recognition system for recognizing speech input w 
at a client by a server, 

the client comprising: 

speech Input means for Inputting speech; « 
user dictionary holding means tor holding a us- 
er dictionary formed by registering target rec- 
ognition words designated by a user; and 



transmission means for transmuting speech 
data input by said speech input moans, diction- 
ary management information used to deter- 
mine a recognition field of a recognition diction- 
ary used to recognize the speech data, and the 
user dictionary to the server, and 

the server comprising: 

recognition dictionary holding means for hold- 
ing a plurality of kinds of recognition dictionar- 
ies prepared for respective recognition fields; 
determination means for determining one or 
more recognition dictionary corresponding to 
the dictionary management Information re- 
ceived from the client from the plurality of kinds 
of recognition dictionaries; end 
recognition means for recognizing tho speech 
data using at toast tho recognition dictionary 
determined by said determination means. 

[0009] Other features and advantages of the present 
invention will be apparent from the following description 
taken in conjunction with the accompanying drawings, 
in which like reference diameters designate the same 
or similar parts throughout the figures thereof. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0010] 

Fig. 1 is a block diagram showing the hardware ar- 
rangement of a speech recognition system of the 
first embodiment; 

Fig. 2 is a block diagram showing the functional ar- 
rangement of the speech recognition system of the 
first embodiment; 

Fig. 3 shows the configuration of a user dictionary 
of the first embodiment; 

Fig. 4 shows a speech input window of the first em- 
bodiment; 

Fig. S shows an identifier table of the first embodi- 
ment; 

Flg> 6 is a flow chart showing the process executed 
by the speech recognition systom of the first em- 
bodiment; 

Fig. 7 shows the configuration of a user dictionary 
appended with input form identifiers according to 
the third embodiment; and 
Fig. S shows the configuration of a user dictionary 
appended with recognition dictionary identifiers ac- 
cording to the third embodiment. 

DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

f901 1] Preferred embodiments of the present inven- 
tion wiQ be described int detail below with reference to 
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the accompanying drawings, 
[Rrs! ^libcdimaolj 

[0012] Fig, 1 allows the hardware arrangement of a ^ 
speech recognition system of the first, embodiment. 
£0013] A CPU 101 systematically controls an entire 
client 100. The CPU toi ioads program* stored in a 
ROM 1 02 onto a RAM 1 03, and executes various proc- 
ewss on the basis of the toaded programs. The 80M » 
1U2«tonw venous programs of processes to be exocut- 
ad by the CPU 1 01 . The RAM 103 provides a storage 
area required to execute various prog rams stored In the 
ROM 102. 

£0014} A8oco^iystofa^devicei04stwe»anOS « 
and vflrkxia programs. When the client 100 Is tmpte- 
mented using not a general-p^jKpose apparatus such as 
a personal computer or the ike but a dedicated appara* 
tus, the ROM 102 may store the OS and various pro- 
prams. By toadmg the stored programs onto the HAM SO 
103, the CPU 101 can execute processes. Aa she sec- 
ondary storage device 104, a hard efts* device* nappy 
disk drive, CO-ROM, or fits like may be used. That is, 
storage madia are not pwtfcut*% limited, 
[0O1S] A network t/F (interface} 105 is connected to a 25 
network W 205 or a server 200, 
[001 6 j An mput aavtee 108 comprtaes a mouse, key- 
board, microphone, end the Hke to allow (neat ^various" 
instructions to processes to be e*eetiied by the CPU 
. 101 , and can be used by simultaneously cwjiftecting so 
these plurality of devices. An output device 107 conv 
prises a display {CRT, ICO, or the like), and displays 
lnte*matkin input by the Input davtoe toe, and display 
windows which are controlled by various processes ex- 
ecuted by the CPU 101; A bw 108 Interconnect various aa 
building components of the client 1 0O 
fCOlTj A CPU 201 systematically controls the entire 
server 200. The CPU 201 loads programs stored in a 
ROM 202 onto a RAM 203, and executes various proc- 
esses on the basis of the loaded programs! The ROM 
202 stores various programs of processes to be execut- 
ed by tho CPU 201 . The RAM 203 provfcot a storage 

mquaeU to e«»u& vaijow programs stored in \im ~~ 
ROM202, 

[0O18] A secondary storage device 204 stores an OS ** 
and various programs, When the server 200 Is impie'- 
rnented using not a versata© apparatus such as a per* 
eonaJ compter or the l*e but a dedicated apparatus, 
the ROM 202 may store the OS and various programs . 
By loading the stored programs onto the RAM 203, the 
CPU 201 can execute processes. As the secondary 
storage devjce204, a hard dJskdevice,. floppy disk drive, 

GD*ROM, or trio like may be used. That Is, storage me> 

da are not particularly limited. 

10019] The network W 205 Is canceled to the net- ' ■» 

work Uh 105 of the dbont 100. A bus 206 interconnects 

various bui^c^ components of the server 200, 

[0020] The functional arrangemsnt of the speech rec- 



ognition system or me first errtboriment wll be de- 
scribed below using Fig, 2. 

{0021) Fig, 2 Is a block digram showingthe functional 
amangemem of tho speech recognition system of the 
first embodiment. 

{0022] m the client 100, a speech Input modulo 121 
inputs speech uttered by the user via a microphone (in- 
put device 106), and A/O-convcrts input speech data 
(speech recognition data) which is to undergo speech 
recognition^ A commurik&tions module 1 22 sends a user 
denary 124a, speech recogn*«n data 124b, dxt >on- 
aty management information 124c, and the t&e te the 
server 200. Also, me communication module 122 re* 
ceto a speech recqgriten result of ihs $m speech 
recognition data 1 24b and the like from she server 200, 
[0023] A display module 1 23 displays the speech rec- 
ognition result received from the server 200 while stor- 
ing n in. e.g.. an input term which is displayed on the 
output dcvfco 10? by the process executed by the 
speech recognition system of this embodiment 
(0024) to the server 200, a ccmrountearjon module 
221 recurves the user dictionary 124a, speech recogni* 
ton data 124b, dictionary management information 
1240, and the like from the c&ent too, Abo, ^commu- 
nication module 221 sends the speech reccgmtk>n re- 
sult Ot the speech rocognsion data 124b and the like to 
the client 100. 

t0026] A dictionary management module 223 switch- 
es and selects a plurality of tends of recognition dactfon* 
arias 225 (rBcogs^ipn tfctkjnary 1 to recognition, diction- 
ary N.N: ap^veihleglw) prepared for ro^ctnwiftc- 
ognftion fieida {e.g., for names, addressee, alphanumer- 
ic symbo&t, end the like), and the user dictionary i?4a 
received from the client 100 (may simultaneously use a 
plurality of kind* of c^wanos). . 
[0026] Note that the plummy of kinds of recognition 
dictionaries 225 are prepared tor each dictionary man* 
agemant tntormaiion 124c (Input fomi feJeni&er; to be 
described tetor) sent from the client 100. Each recogni- 
tion dictionary 225 is appended with a recognition do 
fonary Identifier ideating the recognition field of that 

recognition dctfonaty Tho dJc^onaiy manag&nent 
mwiu!e223 m«riw|R« «ri idsrttiHur lafcfc 223a if wl sto? m 
tho recognition dictionary Identifiers and Input form Idon- 
t£iera in correspondence with each other, as shown in 

Rg.«. 

|002T] A speech roccgn&ioa module 224 executes 
speech recogn^km using the recognStloo dictionary or 
dictionaries 225 and user dictionary 1 24a designated for 
speech recognKion by the dictionary matiegement mod- 
u!e223on o» basis of me speech recognition data 124b 
and dictionary management mformalion 1 24c received 

ftomthecsemioa 

10028] Note Chat the user dictionary 124a is prepared 
by the user to register recognition words wmch are to 
undergo speech recognition, and stores pronunciations 
and notations of words to be recognized In correspond- 
ence wan each other, as shown in, e.g.. Fig, 3. 
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[0029] The speech recognition dale 124b may be ei- 
ther speech data A/D-convsrtod by the speech input 

module 121 or data obtained by encoding lhat speech 
data, 

[0030} The dictionary management information 12 4c 
Indicates an Input object and the like, For example, the 
dictionary management information 1 24c Is an identifier 
(input form idcntflcr) indicating the Jype of input form 
when the server 200 recognizes input speech and inputs 
text data corresponding to that speech recognition result 
to each rput term, which defies a speech input window 
dtsptayod by the speech recognlson system of the first 
embodiment as shown in Fig. 4. The client too sends 
this input form tdemtf ier to the server 200 as the dteioih 
ary management information t24c. In the server 200, 
the dictionary management module 223 I cote up the 
identifier table 223a to acquire a recognition dictionary 
identifier corresponding to the received input form iden- 
tifier, and determines a recognition dictionary 225 to bo 
used in speech recognition. 
[0031] The process executed by the speech recogni- 
tion system of tho first embodiment wtf bo explained be- 
low using Fig. 6. 

10032] Fig, e ts a flow chart showing the process ex- 
ecuted by the speech recognition system at tho first orrv- 
bodBmom. 

[0033] in etap si 01, ms client 100 sends the user dic- 
tionary 124a to the server 200. 
[0034] tn stop 8201 , the server 200 recedes the user 
Dictionary 124a from the cfient 100, 
[D035] !h step S1 02, wfceh speech is input to ah input 
form as a target speech input, the cfient 1 00 sends the 
input term idemtfier of that input form to rho raver 2O0 
as the dictionary management formation 124c 
[003$] In step S202, the server200 receives tho input 
form identifier from the cSent 100 as the dictionary mart* 
agement Information 124c. 

[0037] In step S20O, the server 200 loote irp the Idea- 
. tiller table 223a using the dictionary management infor- 
mation 124c to acquire a recognition dictionary identifier 
corresponding to the received input form identifier, and 

rjttejmlnes a rerogniflon dictionary 225 to bo used in 
— ■ - speech tewiyniliun, 

[0038] lnstepSl03jhocllemi00sefKisspoechrec- 
ogrtition data 124b, which is speech-input as text data 
to be Input to each Input form, to the server 200, 
[0039] In step S204, the server 200 receives the 
speech recognition data corresponding to each input 
form from the client f 00. 

[0040] In step $205, the server 200 executes speech 
recognition of the speech reccgnliton data 124b m the 
speech recognition module 224 using the recognition 

rJctSonary 225 and user dictionary 124a designated tor 
speech recognition by the dictionary management mod- 
ulo 223. 

[0041] In the first embodiment, ail recognition words 
contained In the user dictionary t24a sent from the client 
100 to the server 200 are used in speech recognition by 



the speech recognition module 224. 
[0042] tn step 8208, the server 200 sends the speech 
reoognaion result, obtained by the speech recognSlon 
modulo 224 to the client 1 00. 

a (0043] tn step S104, the client 100 receives the 
speech recognition result corresponding to each Input 
form from the server 200, and stores tend data corre- 
sponding to tho speech recognition rosuft in tho corre- 
sponding input form. 

to [0044] The client 100 checKs in step Si OS 8 the 
processing is 16 be ended, II the processing & hoi to Be 
ended (NO in slop 31 06), the Sow returns to stop S102 
to repeat the processing. On the other hand, if the 
p race ssing ts to be ended (YgSnstepS10$]»the cfsent 

» 100 informs the server 200 or end of the processing, and 
ends the processing. 

[0045) ft is checked tn step S207 if a processing end 
instruction from the client 1 00 ja detected. »f no process- 
ing end irwtructJon is detected (NO in seep $207}, the 

M fbwroiurnsbsieoS202torer^ttrreabc^pn«osses. 
On the other hand, if the processing end instruction is 
detected (YES H step 8207), the processing ends. 
(0046] tn the above processing, whan speech is input 
to an input form as a target speech input tho dictionary 

23 management information 1 24coorrasportdsig to thai in- 
put form is sent from tho client 100 to the server 200. 
AltematrVery, the dictionary management WomtailBn 
124c may be sent when the input torn* as a target 
speech input is focused by an instruction from the Input 

so device 106 (the input form as a target speech input is 
determined). 

[00473 *n the server 200, speech recognition Is made 
After nil speech recognition data «>4b flro received. Al- 
ternatively, every time speech ia input as text data to a 

as given input form, that tho portion of speech recognition 
data 124b may be sent to the server 200 f rame by frame 
(for example, one frame is 10 msec speech data), and 
speech recognition may be made hi real time. 
{0048) As described above, according so the fast ern- 

40 bodimont, in the cfierrt -server speech recognition sys- 
tem, since the server 200 executes speech recognaion 

of speech recognition data 1240 using both an appro* 
priHttj reeugrrSion dkiiunaiy 22$ and tire mw dfctiui any 
124a. the speech rocogrtWen pteckloo lathe *orver200 
43 can be improved while reducing the proce ssing load and 
use of storage resources associated with speech rec- 
ognition in the client 100. 

[Second Embodiment] 

no 

(0049) tn the first e/Ttb<xJimejiL if no recognition words 
to be stored in the user dictionary 124a are generated, 

since the user dictionary 124a need not be used, the 
server 200 may use ait recog n tticm words iin the user do 
33 nonary 124a In recognition only when a use request of 
the user dictionary 1 24a ia received from the dieni 1 00. 
[0060] IntWscase.aflsglrritaBt^fr^ 
ary 1 24a is used is added as the dictionary management 
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Infomagon 124c, thus Informing the sewer 200 cf the 
presoncetebaenco of use of the user dictionary 1 24a. 

fThird Embodiment! 

5 

[0051] Sto somo target neco^lion words In the us- 
bt dictionary 124a are not uaod dapendxig on an input 
object, situation, and the Bxe. only specrte recognise* 
words in the user dictionary 124a may be used m rec- 
ognition appending on the Input object end situation. w 
[0U52] Irr Such cMe.wrhMBttOstfdiaM^iyi* man- 
aged by designating rsput form identifiers for respective 
recognition words, as shown In Rg. 7, only recognition 
words having en input form identifier of the input form 
used in speech input can be used in recognition , Alter* « 
natively, a pluraSfy of input form tdenbnera may be des- 
iccated for a given recojjratlonwoni In addition, the us- 
er dictionary may be managed by designating recogni- 
tion dictionary Identifiers In place of input form Identify 
ers, as shown In Rg. ft. 29 



computer, after the program code read out from the stor- 
age medium is written in a memory of the extension 
board or ml When the present invention is appffed to 
tho storage modium, that storage medium stores a pro- 
gram code coxresponcfing to the flow chart shown in Fig. 
3. 

[0056] As many apparently widely diff erent embodi- 
ments of the present invention can be made without do- 
parting from the spirit and scope thereel, & is to be un- 
derstood that the invention t» not tirraed to the specific 
ernDooimenis tnereoj except as aertnea in mo append* 
od claims. 



Ctetms 

1 , A cilem-server speech recognition system for rec- 
ograring speech input at a client by a server, 

Ihe client comprising: 



[Fourth Embodiment] 

[0O53] By combining the second and third embodi- 
ments, the offkaoncy of the speech recognition process 25 
or the speech recognition module 224 can be farmer Im- 
proved. 

(Rah Embodiment! 

[0O54J Mo«t0fihep^cc^SeS0lmde^ra{ua6fmd 
present invention can be implemented by programs. As 
described above, tinea tho Apparatus can uto a gener- 
ai-purpose apparatus such as a personal computet, the 
present Invention Is also achieved by suppling a star- 3s 
age medium, which records a program code of a soft- 
ware program that can Implement tho functions of the 
abc^e^efitiofldd embcd3mema to a system or appara- 
tus, and reading out and executing the program code 
stored in me storage medium by a computer of the sys- 49 
tern or apparatus. In this case, the program code itself 

road out from &to storage medium m*cmcm tho runt- 

lions of 1 he abovo-ntunjfumiti ssnbyUimenta, and lias 
storage medium which stores the program code consti- 
tutes the present invention. As the storage medium few 4S 
supplying the program code, tor example, a floppy disk, 
hard disk, optical disk, magneto-optical disk, CD-ROM. 
magnetic tape, nonvolatile memory card, ROW , and the 
l*o maybe used. 

[0055] The present invention can also be achieved by so 
supplying the storage medium that records the program 
code to a computer, and executing some or all of actual 

processes executed by en OS running on mo computer. 
Furthermore, tne functions of the above-mentioned em- 
bodiments maybe Implemented by somcor all of actual » 
processing operations executed by a fJKJ or the like 
arranged in a function extension board or a function ex- 
tension una, which is inserted in or connected to tho 



speech input means lor inputting speech; 
user dictionary holding means tor holding 
a user dictionary formed by registering tar- 
gat recognition words designated by a us- 
er, and 

transmission means for transmuting 
speech data Input by said speech Input 
means, dictionary management Informa* 
Hen used to determine a recognition field 
of a recognition cscffOrtafy used 10 recOf> 
nlze the speech data, and the user diction- 
ary to tho server, and 

tho server comprising: 

recognition dictionary holding means for 
holding a plurality of kinds of recognition 
dictionaries prepared for respective recog- 
nition fields; 

determination means tor determining one 

or rnoro fccognlUoh dictionary cof respond* 

ing to U w dictionary munaygrnuni iitfqrrna- 
tk»n received from the client from the plu- 
rality of kinds of recognition dictionaries; 
and 

recognition means tor recognizing the 
speech data using at least the recognition 
dictionary detenrtined by said determina- 
tion means. 

2. The system according to claim 1 , wherein said roc- 

ogniton means recognizes tho speech data using 
0>e recagnaion dfctionary determined by sard deter- 
mination means, and tho user cTctie-nary received 
from the cbsnt, 

3. The system according to claim 1 or 2, wherein said 
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speech input means comprises display means for data io£mg at least the recogn&ion dictionary 

dispSaymg an input form as a tergetspeeefj input, determined in the determination step, 
and 

the dle&mary managewwwt information Is an 10. The nsethod according to dafrn 9. wtiereln me rec* 

input farm idantifsor that indicates s type of input s oc^taion step inctadea a step of recognizing the 

form speech data using the recognition dictionary deter- 
mined in the determtnafcor) step, and the user dc- 

4 the system according to any of claims 1 10 3, Bonaiy feceiwed from the client 
wherein the dictionary rnanagement information 

conmlreifftorrh^ to 11„ The method according to daim 9 0? 10, wherein the 

« used in recoghrBdh of the spoecfl dm speech irtou! slop cPmpHses a m**f of d& 

playing an input form as a target spe ech input, and 

5. The system according to any preceding daihx me<§ettonarym&?»gement irrfvormatSon is an 
wherein the wet dlcttonaiy Is fgrmedoy storing pre- input form Ntentier thai IndJwto a type of inpy t 
rtuncia&ons and notations 0? the target recognition « form, 

words in corresponcencowtth each other. 

12. The method according to any of claims 9 to 11, 

6. The system according to cla^ wherein B>e dictionary management information 
dictionary is formed by also storing at least one In- contains Wom*ation Indicating If the user dictionary 
out form ideniSner and the target teceejn&ion words » is used in receonfe of the speech data. 

In correspondence with each ether, 

ia. The method according to any of cMm 8 to 12, 

7. The system according to any preceding daim, wher««n theuserdk^nary»foimed fay atariing pro- 
wherein the user denary Is Tornwd by also stortng nundations and notations of the targ<a rocognmon 
si teas! one c^ recogrirtton dictionary xterriifters in- 25 words m correspondence with each other, 
dieting recognition Holds of tho plurality ot Mnd* of 

feeognison dicitowji'ea, and the target recagn3«an 14. Tha memod according to claim 11, wnorammsuser 

words, dictionary is formed by also storing at least one In- 
put form identifier and the target recognition words 

«. The system according to any preceding dalrn, <» in correspondence with each other, 
wherein tho spootfo 4ata is data Obtained by eneod- 

ing that speech data. 15. The method according to any of claims 0 to 1 4, 

wherein the userdicsiottsn/ is termed by also storing 

0. A method of controlling a cSent-server speech reo- at least one of racognaion dictionary idetrWiere In- . 

ognltton system for recognizing speech input at a as dealing recognition fields ol the plurality of kinds of 

client by a server, comprising: racogr&fcn djctienaaies, and the target reoognrlion 

■ . words, 
a speech input stop of inputting speech ; 

a user dictionary holding step ot holding, in the 16. The method according to any of cfeims & to 1 S, 

client, a user dictionary formed by registering 40 wherein the speech date is data obtained by encod- 

target recognition wordB designated by a user; ing that speech date. 

and ■ 

a .jiuruttifBtori step* Is^iarfiUinjsp^liUala" ~ " 17. AuunvuUs i<mm>tfmmfm^^w^^m 

Input tn the speech input step, pSettenary man* codeofcc^cfaclieRte^ 

agement infom^n used to determ&w too- <S systamfor recognizing speech input at a dlens by a 

ogniiionflektofarec^ server, comprising: 

recognize the speech date, and the user die* 

Nonary to the server? aprogramcocfetf aspe^STp^ 

arect^iiendactjona^ . tingspeech- 

in the server, a pluralfey of Wnds of recognition *e a program code of a user dBcttonary holding 

. dictionaries prepared tor respective recognition stop of hotting, in the cSertt, a user dictionary 

fields; formed by registering target recognition words 

a c&ierminmion step of c&crmimng one or designated try a user; and 

more recognition dictionary corrospond&g to aprogramcodoof a transmission step of tmns* 

the dictionary management Information re- « mltBng speech data fcptft in the speech Input 

cervedtrom the diard tram the pforaJriy of lends step, dictionary management intcreriafcoh used 

of recognilton dictionaries; and to determine a recognition heW ofarecognaion 

a recegnSien stop of racogni2ing die speech dictionary used to recogniio 4h* speech date, 
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and the met CBctlonery to the server ; 
a program code of a recognition dictionary hold* 
ing step of holding, In foe server, a plurality of 
kinds of itjcognmow dictionaries prepared tor 
respective recognition f iolcfa: 
a program codo of a dctenrtfRatfoo step of de* 
tormtning one or more recognition dictionary 
corresponding to the dictionary management 
information received from She client from the 
plurality of klnde of recognition dictionaries; and 
a pttgfarri code 61 a recognrtjdn tfap 01 reccg* 
nizing the speech data using at least the rec- 
ognition dictionary determined in the detem£» 

nation step. 

18. A speech recognition server for recognizing speech 
input at acM* and sending a recognition result to 
the client* comprising: 

rocepCfon means for receMna. from the cfleni 
speech data, olcdonaiy management informa- 
tion used to determine a recognition field of a 
recognition dictionary used to recognize the 
speech data, and a user dictionary foimcd by 
registering target recognition words designated 
by a user; 

recognition dictionary holding means (or hold- 
ing a plurality of kinds of recognition dictionar- 
ies prepared tor respective recognition fields; 
determination means for dte$enranfe*g one or 
more recognition dictionary ecrreipcn&ng to 
the tfctlonary management information re- 
ceived from the dienf tram the piuraSity of kind* 
of recognftiori dictionaries; and 
recognition means tor rccogstlzfcg the speech 
data using a! least the recognition dictionary 
determined by said determination meant. 

1 9. The server according to claim i 8, wherein said rec- 
ognition means recognizes the speech data using 
the recognition dictionary determined by said deter- 

mtoaflon moan** and ttio user cfctlonary received 
hum iho client 

20. TnesejvBracccnfingtoctainte or 19, whersinthe 
speech data la data obtained by encoding that 
speech data. 

21. A speech reeognitlen client for sending input 
speech to be recognized to a server, and receiving 
a recognition resut of that speech, comprising: 

speech input moans for inputting speech; 
user dictionary hotting means tor holding a us- 
er dictionary foimcd by registering target rec- 
ognition words designated by a user; and 
transmlsston means tor transmitting speech 
data input by said speech Input means, diction- 



ary management ^formation used to deter- 
mine a recognition field of a recognition dSctton* 
ary used to recognize fte speech data, and the 
user dictionary to the server. 

3 

22. The client according to claim 21, wherein said 
speech input means comprises display means tor 
displaying an Input form as a target speech Input, 
and 

»fl the dictionary management Information Is an 

input form ttshfi&r thai ihdjeate* a typo of input 
form. 

23. The cfsnt according to cialm 21 or 2 2, wherein the 
» dictionary management Intormatfon conta*» infor- 
mation indicating if the user dictionary b used in rao- 

. og^rionof the speech data, 

24. The dlent according, to any of claims 21 to 23, 
& wherein the user dictionary fs formed by storing pro- 
nunciations and notations of the target recognition 
words in correspondence with each other. 

25. The client according to claim 22, wherein tho usor 
33 dictionary is rbrrned by also storing at least one in- 
put form Identifier and the target recognition words 
in wrreepondenoa wttft each other. 

28. The client according to any of claims 21 to 25, 
so where* the user dictfona^ 

al least Che of recognition^ dictionary icfcrMSiers in- 
dicating recognition f ields of the plurality of lands of 
reeogn&fen dftianarfes, and the Urge! reoognBlon 
words. 

33 

27. The client according to any of claims 21 to 25, 
where** tho speech data is data obtained by encod- 
ing that speech data. 

40 28* A method of controlling a speech recognition server 
for recognizing speech input at actieret, andsending 

a rocognffion result to the diem, cornprlslng: 

a reception step of reooMng, from the cScnt, 
45 speech data, dictionary management Informa- 

tion used to determine a recognition field of a 
recognition dictionary used to recognize the 
speech data, and a reef dictionary formed by 
registering target recognition words designated 
30 by a user; 

a recognition dictionary holding etep of holding 
a plurality of kinds of recognition dictionaries 
prepared for rcspcciivc recognition nclc»; 
a determination step of dctertsjining one or 
33 more recognition dictionary corresponding to 

the dictionary management iratomrtation re- 
ceived from the client from the pj uralfy of Kinds 
of recogntlon dictionaries; and 
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a recognition step of recognizing the speech 
data using al least the recognition dictionary 

determined in &e (teiwrnintatton step. 

29. Tlie method according to dain 28, whore*! the rec- 
ognition stop includes a step of recognizing the 
speech data using the recognition dictionary deter- 
mined in tno determination step, and the user dk> 
tionary received from the cfienL 

30. The method aceordirtg to ciarn 2B Of where* 
the speech data is data obtained by encoding that 
speech data. 

31 . A method of controlling a speech recognition e$cnt 
for Bending input speech to be recognized to a serv- 
er, end receiving a recognition result of that speech, 
comprising? 

a speech input step of inputted speech ; 
a user dictionary holding step of holding a user 
dictionary formed by registering target recogni- 
tion words designated by a user; and 
a transmission step of transmfftlng speech data 
input in the speech input step, dictionary man- 
agement information used to dctemMno a roc* 
ogntaon field or a recognition dictionary used to 
recognize the speech data, and the user dic- 
tionary to mo server. 

92. The method according" 10 claim 31. wherein the 
speech input step comprises a display step of dls- 
piAyfog «n Input term at a t Argot speech Input, and 
the dictionary management information is an 
input form Identifier that indicates a typo of input 
form. 

33. The method according to claim 31 or 32, wherein 
the dictionary management Information contains in- 
formation indicating tf the user dictionary is used in 
recognition of the speech data. 

34. The melhetf auajufaiy to any uf tttam 31 to 33,. 
wherein the user dictionary Is formed by storing pre* 
iwinciabens and notations of the target recognition 
words In correspondence with each other 

35. The method according to dasn 32, wherein the user 
dictionary Is formed by also storing at least one in- 
put form Identifier and the target recognition words 
in correspondancG with each other* 

36. The metnod according to any of datra 31 to 35, 
wherein the user dictionary is formed by also storing 
at least one of recognition dictionary identifiers ift- 
cbcaSng recognition holds ct tho plura&y of kinds of 
recognition dictionaries, and the target recognition 
words. 



37. The meihod according to any of claims 31 to 36, 
wherein the speech data is data obtained by encod* 

irtg tM speech data, 

0 38. A con^xjtorreadabla memory that stores a program 

code of control of a speech recognition server for 
recognizing speech input at a dent, end sandmg a 
recognitton rcsuS to tfio cient, comprising: 

to a program cede of a reception s*sp of 

from the client, speech dm diefidriary rnah- 
agoment information used to determine a rec- 
ognition Held ore rocognjtlon dictionary used to 
recognixe ihe speech da&. and a user ejetien- 
« ary formed by registerteg target recognition 

words designated by a user: 
a program code of a recoc^nton dictionary hold- 
ing step of holding a plurality of kinds of recog- 
nition dictionaries prepared for respective rcc* 
SO oflnltion fields; 

a program code of a determination step of de- 
termining one or more recognition dictionary 
corresponding to the dictionary management 
^formation received from the client from the 
?3 plurality of lands of recognition dictionaries; and 

a program cede of a recognition step of recog- 
nizing the speech data using at: least the rec- 
ognition dictionary determined In the determi- 
nation step. 

30 

99. Acoritputerreadab^ 

cede of control of e speech recognition client for 
tending input spoocn to be rceoani wsd to a server, 
and receiving a recognition result of thai speech, 
as comprising: 

a program code of a speech Input step of input* 
ting speech; 

a program code of a user dictionary holding 
40 step of holding a user dictionary formod by rcg- 

latering target recognition words designated by 

a user; and 

a ptugrien cede of a liansnsssiun yiep uftrHits- 
mftting speech data Input in the speech input 
** step, dictionary management Information used 

to determine a recognition field ot a recognition 
dictionary used to recognize the speech data, 
and the user dictionary to the server, 

*> 40. A client-server speech recognition system for rec- 
ognizing speech Input at a client by a server, 

the client comprising: 

« a speech Input uns inputs speech; 

a user dictionary hokfing a user dictionary 
formed by registering target recognition 
words designated by a user; and 
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a transmitter transmits speech data input 
by said speech input means, dictionary 

management mfarmailon used to deter- 
mine a recognition field of a recognition dic- 
tionary used to recognize tho speech data, a 
and the user dictionary to the server, and 

tfto server comprising:. 



43. Processor imptementable instructions product for 
causing a programmable computer device to cany 
out the melhod of any of clains 2a to 37, when ihe 
Instructions product Is run on said programmable 
computer device. 



a recognition dictionary holding a plura&y to 
o? Kjnccs o? recogroaon otcaonanos pro* 
pared far respective racogrxftjon fiofcfe; 
a determination unit determines one or 
more recognition dKaionary oorrcapondiog 
to the dictionary management information « 
raceved from the client from the plurality 
of kinds of recognition dictionaries; and 
a recognition unit recognizes the speech 
data using at least the recognition diction- 
ary determined by said determination SO 
means. 



41. A speech recognition server fcf recognizing Epasdh 
input at a client and sending a recognition result to 
the client, comprising: 

a recover raee&ea, from the eOsnt, speech da- 
ta, dictionary management information used to 
determine a recognition field of a recognition 
dictionary used to recognize the speech data, 
and a user dfeGonfiry formed by registering W- 
get recognition words designated by a user; 
a raconnsttftft dictionary fielding a plurality of 
lends of recognition dictionaries prepared for 
respective recogn&lcn fields; 
a determination unit determines one or more 
recognition dictionary corresponding to mo die- 
tionflry management information received horn 
the client from the plurality of kinds of recogni- 
tion dictionaries; and 



using at least tho rcoegninon dictionary deter* 
mined uy said determination weans. 

42- A speech recognition client for sending Input 
speech to be recognized to a server, and receiving 
a recognition resut of that speech , comprising; 

a speech input una inputs speech; 
a user dictionary hoidmg a user dictionary 
formed by registering target recognition words 
designated by a user, and 

a transmacf transmits speech daaa input by 
said speech input means, dactionary manage- 
ment information used to determine a recogn> 
bon t«5d of a recognition dfictionary used to rec- 
ognize the speech data, and the user dictionary 
to the s 
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